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We introduce the "Sequential Empirical Bayes Method", an adaptive constrained-curve fitting procedure for 
extracting reliable priors. These are then used in standard augmented-chi-square fits on separate data. This 
better stabilizes fits to lattice QCD overlap-fermion data at very low quark mass where a priori values are not 
otherwise known. We illustrate the efficacy of the method with data from overlap fermions, on a quenched 16^ x 28 
lattice with spatial size La = 3.2 fm and pion mass as low as ~ 180 MeV. 



1. Background 

Traditionally, Monte Carlo estimates, (G(i)), 
of two-point hadronic correlators have been fit to 
a theoretical model, such as. 
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by the maximum-likelihood procedure of mini- 
mizing the chi-square. 

Fitting with a single-source multi-exponential 
is usually too unstable, so the default has been the 
popular single-source single-exponential, wherein 
one fits only at i > tmin to damp contributions 
from excited states. One compromises between 
high statistical for large imin and high system- 
atic errors for small tmin; lattice alchemy provides 
various recipes for making the compromise and 
estimating the systematic errors. A multi-source 
multi-exponential fit is a much better, albeit more 
expensive, alternative; however, the ambiguity of 
estimating systematic errors through tuning imin 
remains. 
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"Constrained Curve Fitting" jll2j offers the al- 
ternative of minimizing an augmented chi-square. 
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where p denotes the collective parameters of the 
fit (e.g. p — {w,to}). It achieves stability by 
"guiding the fit" with the use of Bayesian priors, 
that is, values of the parameters obtained from a 
priori estimates p = p ± a. It has improved sta- 
bility; as data sets are enlarged to include small 
t, many more terms are added in the fit model 
until convergence is obtained. The imin system- 
atic error is largely absorbed into the statistical 
error. The method works well if reliable priors 
are known; nevertheless, stability of the fit re- 
sults against choice of prior must be tested, and 
this reintroduces an element of subjectivity. 

However, with our recent data we enter 
previously unexplored territory, namely, overlap 
fermions with exact chiral symmetry at unprece- 
dented small quark mass and large spatial size. 
There is in general no literature from which to 
obtain estimates to be used as priors and no reli- 
able model for estimates of level spacings. 
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2. Overview 



3. Sample Result 



We propose an adaptive self-contained con- 
strained curve-fitting procedure, dubbed the "Se- 
quential Empirical Bayes Method" method. We 
obtain the priors gradually (from the ground state 
up) as the data set is monotonically enlarged (to 
include earlier and earlier time slices). The basic 
"fixed At" algorithm is described below for sim- 
plicity. In practice we use a refinement, the more 
adaptable "variable At" method, results of which 
are presented in the next section. 

1. Choose (i) tmin and tmax, the maximum 
range over which the fits will be done, and 
(ii) istart, the initial minimum time slice of 
the fitting range. 

2. Loop on various trial values ("scanning") 
around central values wi and mi obtained 
from effective mass. For each, use an un- 
constrained fit on the one-mass-term model 
(IMTM) to fit over {tstart, tmax} to obtain 
w^i'^ ±aw} and m^^'' ±am\ ■ Choose as input 
for the next step those values which yield 
the lowest x^- 



3. Using these Wi, cr^^, TOi, (t„ij as both priors 
and initial values, do a constrained-curve 
fit, using the IMTM over {tstart - 1, ^max}, 
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to obtain w\ , a. 
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4. Using these wi, cr^j, mi, ami both pri- 
ors and initial values, do a half-constrained 
fit over {tstart - 2, tinax} On a 2MTM; the 
second mass and weight are unconstrained. 
Loop on various trial values for the latter; 
choose as input for the next step those val- 
ues which yield the lowest x^- 

5. Using these wi, a^^, mi, ami, ^2, a^^, m2, 
am-i as both priors and initial values, do a 
fully-constrained fit, using the 2MTM over 
{tstart - 3, tmax}, to obtain wf', a''wl , rnf\ 

(4) (4) (4) (4) (4) 

6. Repeat the last two steps until all desired 
mass terms and time slices are included. 
One thus obtains a complete set of priors. 



The {AiAi) correlator is dominated by the 
ground state of the pseudoscalar channel (pion) 
over all but the few earliest time slices. With the 
variable At refinement of the algorithm, rather 
than deciding a priori on the number of terms in 
the fit and adding time slices a fixed number at 
a time, one lets the data decide how many time 
slices to include with each enlargement of the data 
by choosing the minimum chi-square over a range 
of reasonable possibilities. Thus since the pion 
correlator is dominated by the ground state for 
many time slices, then many time slices will be 
automatically added before an attempt is made 
to fit the first-excited state. We find that the 
variable At method works very well for the pion 
correlators. 
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Figure 1. Ground and first-excited state pseu- 
doscalar masses as obtained from the Sequential 
Empirical Bayes' Method. 



4. Testing the Algorithm 

4.1. Partitioning the Configurations 

Final results should use the priors on a separate 
data set, thus preserving the ideal Bayesian case. 
However, empirically this seems to be unneces- 
sary; the output from the algorithm agrees with 
the final results. We have implemented the fol- 
lowing test: We partition the data into two non- 
intersecting sets of configurations, A and i3, with 
an equal number, tia — ns = 40 of configurations 
in each set. Using the set A of configurations, we 
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perform our procedure to obtain priors; we next 
use this fixed set of priors in the canonical way to 
perform a constrained fit separately on data set 
B, on data set A, and on the full set AU B. We 
find no appreciable differences beyond expected 
statistical fluctuations. 

4.2. Stability 

The Sequential Empirical Bayes' Method is 
used to select the priors. Then a standard con- 
strained fit with all of the time slices is made. 
The number of terms in the fit model is increased 
until the fit results converge. 



Number of terms 

Figure 2. Fit values for the lowest two pseu- 
doscalar masses (quark mass 0.18783), from stan- 
dard constrained fits (using the priors obtained 
from the Sequential Empirical Bayes' Method). 
These are stable as more terms are included in 
the fit model. 

4.3. Reconstructing Artificial Data 

The method can successfully reconstruct the 
parameters of artificially-constructed data where 
the true results are known independently of the 
fit. 

We created a sample of artificial data as a sum 
of decaying exponentials by adding an indepen- 
dent Gaussian noise to the function at each value 
of t. The fitting procedure was able to reconstruct 
the means and weights for the ground and excited 
states; the actual values were within one mea- 
sured standard deviation of the measured means. 
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Figure 3. Recovery of masses from artificially- 
constructed data (masses 0.25, 0.5, 1.0, 1.4, 1.8, 
and amplitudes 1.20, 1.00, 0.80, 0.60, 0.50). 

5. Summary 

We have introduced an adaptive self-contained 
constrained curve-fitting procedure which pro- 
duces priors to be used in a standard constrained 
fit on different data. One obtains the priors se- 
quentially as data set is enlarged. The method's 
advantages include: it is usable whenever exter- 
nal reliable estimates of the priors are not avail- 
able; it can be fully automated; it reduces human- 
induced bias; it decreases the frustrating busy- 
work of fitting; and it is self-correcting. We have 
checked that the method can reconstruct artificial 
data, that it is stable against adding more terms, 
and against partitioning the data. 

For more complete details, see 0j which pro- 
vides additional examples of applicability and 
outlines further refinements. The method has 
been used successfully to make the first lattice 
identification W of the Roper resonance at low 
quark mass with exact chiral symmetry. 
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